Valency Theory in Stratificational MT-System

نویسنده

  • Paul Schmidt
چکیده

This paper tries to investigate valency theory as a linguistic tool in machine translation.There are three main areas in which major questions arise: (i) Valency theory itself. I sketch a valency theory in linguistic terms which includes the discussion of the nature of dependency representation as an interface for semantic description. (2) The dependency representation in the translation process. I try to sketch the different roles of dependency representation in analysis and generation. 3) The impl~m~_ntation of valency theory in an MTsystem. I give a few examples for how a valency d~xzription could be impleme~ted in the EUROTRAformalism. O. Introduct:[on This paper ~ies to apply a piece of established linguistics, known as valency theory, to the problem of machine translation. As such, it is meant to fit into the forthcoming EUROTRA MT system, though it does not deal with EUROTRA problems sp~;ifically. There are two aspects which play a role in the building of ~Ln Ff£-syste~; (i) the development of linguistic specifications and (2) the development of a formalism which allows for the implemenlmtion of linguistic results. ~lis paper aninly deals with the first aspect and ~/~us relates not only to EUNOTRA but also at least to all stratificational systems, i.e. systems t2mt break up the translation process into a sequence of sJx~#ler translation processes. Furthermore it relates at least to any system wi]ich uses dependency/valency inforalation as e.g. IAgG does with its functional structure. In EUROTRA, the level where information about dependency/valency is used is the hqIS (EurotraRelational-Structure) which lies between the constitueat stzncture (ECS) and the semantic intea-face stzucture (IS). So, in EOROZRA terms I try here to give a kind of ERS-definition in the language of empirical linguistics without touching the formalism itself. The investigation divides into three parts: (i) the sketch of a v~l.ency theory which comprises the following points: the informal definition of the concepts valency, cc~plement,ar~ adjunct, thereby trying to give a definition which holds for verbs, adjectives, nouns, and prepositions, the operationalizati~1 of the complement adjunct distinction derived frc~! this definition, the classification of the complements, a subclassification of f~e complement classes (Cclasses) according to their syntactic realization, and the determination of the relevant sentence patterns, a short discussion of the relation of the dependency level to constituency level and from dependency level to semantic representation. (2) the application of the linguistic specifications to the problem of MT which llas to investigate the role of dependency representation (D-representation) (oi" rather the role of the transitions to D-representation, since levels do not "play a role") in an,91ysis and generation. (3) the way the gained linguistic information can be implemented according to the EOROTRA formalism. i. Outline of a Syntactic Val~ ~_~ I. 1 Definition of Concej3~s The syntactic vale~zy of an el~ent of a word class (a nonterminal category) is its property to bind a certain number and a certa~ kind of syntagma. Those valency--bound syn'l~o~,'e:-~ a re t h e oon~icm~e~ts. 'I~e s%a~tagnms which are ~t valency-bound are the adjuncts. All syntagmas which are obligatory in a syntactical sense are valency-bound, i.e. the sentence a~st contain tt~t item otherwise it would not be complete. Bedsides the obligatory syntagmas the wordL~ubclass specific elea~ts are valency-bound (Engel 1982). Wordsubclass specific e]em~--nts are those syntagmas which can occur only with elemlents of wordsubclasses. i. 2 Qperationalization of valency 'fhis point usually forms the major part of each paper on vale~cy theory and mostly consists of a discussion of various proposed tests in the literature on valency theory showing their deficiencies and ccmdng up with a new proposal.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotation Lexicons: Using the Valency Lexicon for Tectogrammatical Annotation

We present a formalization of the valency theory (Panevová, 1974) that fits the stratificational representation scheme used in the Prague Dependency Treebank. The notion of a lexicon as a repository of “static” (invariable, or context-independent) source of information is formally presented; a different type of lexicon is used at every layer of sentence representation, with a formal link to thi...

متن کامل

Verbal Valency in the MT Between Related Languages

The paper analyzes the differences in verbal valency frames between two related Slavic languages, Czech and Russian, with regard to their role in a machine translation system. The valency differences are a frequent source of translation errors. The results presented in the paper show that the number of substantially different valency frames is relatively low and that a bilingual valency diction...

متن کامل

The need for MT-oriented versions of Case and Valency in MT

This paper looks at the use in machine Translation systems of the linguistic models of Case and Valency. It is argued that neither of these models was originally developed with this use in mind, and both must be adapted somewhat to meet this purpose. In particular, the traditional Valency distinction of complements and adjuncts leads to conflicts when valency frames in different languages are c...

متن کامل

VALENCY and MT: Recent Developments In The METAL System

This paper describes a valency model, developed within the Belgian METAL project, aimed at enhancing the modularity and multilinguality of the METAL system. The introduction provides background, section 1 discusses the existing valency framework, and section 2 presents the alternative model. The final section deals with some results and problems with this model.

متن کامل

Incorporation of a Valency Lexicon into a TectoMT Pipeline

In this paper, we focus on the incorporation of a valency lexicon into TectoMT system for Czech-Russian language pair. We demonstrate valency errors in MT output and describe how the introduction of a lexicon influenced the translation results. Though there was no impact on BLEU score, the manual inspection of concrete cases showed some improvement.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1986